Remote Sensing Cross-Modal Text-Image Retrieval Based on Global and Local Information

نویسندگان

چکیده

Cross-modal remote sensing text-image retrieval (RSCTIR) has recently become an urgent research hotspot due to its ability of enabling fast and flexible information extraction on (RS) images. However, current RSCTIR methods mainly focus global features RS images, which leads the neglect local that reflect target relationships saliency. In this article, we first propose a novel framework based (GaLR), design multi-level dynamic fusion (MIDF) module efficaciously integrate different levels. MIDF leverages correct information, utilizes supplement uses addition two generate prominent visual representation. To alleviate pressure redundant targets graph convolution network (GCN) improve model s attention salient instances during modeling features, de-noised representation matrix enhanced adjacency (DREA) are devised assist GCN in producing superior representations. DREA not only filters out with high similarity, but also obtains more powerful by enhancing objects. Finally, make full use similarity inference, come up plug-and-play multivariate rerank (MR) algorithm. The algorithm k nearest neighbors results perform reverse search, improves performance combining multiple components bidirectional retrieval. Extensive experiments public datasets strongly demonstrate state-of-the-art GaLR task. code method, MR algorithm, corresponding files have been made available at https://github.com/xiaoyuan1996/GaLR .

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Content-Based Remote Sensing Image Change Information Retrieval Model

With the rapid development of satellite remote sensing technology, the volume of image datasets in many application areas is growing exponentially and the demand for Land-Cover and Land-Use change remote sensing data is growing rapidly. It is thus becoming hard to efficiently and intelligently retrieve the change information that users need from massive image databases. In this paper, content-b...

متن کامل

Cross-modal Retrieval by Text and Image Feature Biclustering

We describe our approach to the ImageCLEF-Photo 2007 task. The novelty of our method consists of biclustering image segments and annotation words. Given the query words, we may select the image segment clusters that have strongest cooccurrence with the corresponding word clusters. These image segment clusters act as the selected segments relevant to a query. We rank text hits by our own tf.idf ...

متن کامل

Layered Hypernetwork Models for Cross-Modal Associative Text and Image Keyword Generation in Multimodal Information Retrieval

Conventional methods for multimodal data retrieval use text-tag based or cross-modal approaches such as tag-image co-occurrence and canonical correlation analysis. Since there are differences of granularity in text and image features, however, approaches based on lower-order relationship between modalities may have limitations. Here, we propose a novel text and image keyword generation method b...

متن کامل

Integrated Information Mining and Image Retrieval in Remote Sensing

Most existing remote sensing image retrieval systems allow only simple queries based on sensor, location, and date of image capture. This approach does not permit the efficient retrieval of useful information from large image databases. This chapter presents an integrated approach to retrieving spectral and spatial patterns from remotely sensed multiand hyperspectral images using state-of-the-a...

متن کامل

Cross-modal domain adaptation for text-based regularization of image semantics in image retrieval systems

In query-by-semantic-example image retrieval, images are ranked by similarity of semantic descriptors. These descriptors are obtained by classifying each image with respect to a pre-defined vocabulary of semantic concepts. In this work, we consider the problem of improving the accuracy of semantic descriptors through cross-modal regularization, based on auxiliary text. A cross-modal regularizer...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Geoscience and Remote Sensing

سال: 2022

ISSN: ['0196-2892', '1558-0644']

DOI: https://doi.org/10.1109/tgrs.2022.3163706